An improved Fuzzy Clustering methodology applied to the study of Protein Conformational Ensembles

نویسندگان

  • Duhu Man
  • Isabel Maria Timon-Perez
  • Jesús A. Soto
  • Antonio Flores-Sintas
  • José M. Cecilia
  • Horacio Emilio Pérez Sánchez
چکیده

Clustering is a technique that aims to group data objects. Various similarity measures such as Euclidean, city-block, Mahalanobis distances and coisine similarity [1] have been used for discovering the underlying structures in data. Formally, the problem of clustering may be described as follows: Given a set of data objects X = {x1, x2, . . . , xn}, a clustering algorithm determines a suitable number k of homogeneous groups, and maps the data points to the labels in the set C = {1, 2, . . . , k}, where each label identifies a homogeneous group of objects. A good clustering algorithm should have the following characteristics: it should be scalable, i.e., perform well on data sets having large number of objects and also large number of attributes, should be able to determine clusters of varying shape and size, should have least requirement of domain knowledge (e.g., number of clusters, thresholds, termination condition parameters), should work well in the presence of noise and outliers, and should be insensitive to order of objects [2]. We present in this work an improved fuzzy clustering algorithm [3] that fulfills those criteria and we show its application for the classification of protein conformational ensembles, problem that arises in many domains of structural bioinformatics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Water Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis

Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters&#10 &#10In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy cluster...

متن کامل

Water Quality Zoning of Rivers by the Technique of Fuzzy Clustering Analysis

Zoning the pollution of a river may be the first or even the most important step in water quality management. In order to resolve its pollution, fuzzy clustering analysis may be used whenever a composite classification of water quality incorporates mutiple parameters In such cases, the technique may be used as a complement or an alternative to comprehensive assessment. In fuzzy clustering ...

متن کامل

OPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM

This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...

متن کامل

New Approach for Customer Clustering by Integrating the LRFM Model and Fuzzy Inference System

This study aimed at providing a systematic method to analyze the characteristics of customers’ purchasing behavior in order to improve the performance of customer relationship management system. For this purpose, the improved model of LRFM (including Length, Recency, Frequency, and Monetary indices) was utilized which is now a more common model than the basic RFM model apt for analyzing the cus...

متن کامل

A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm

Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014